A Coreference Resolution Approach using Morphological Features in Arabic

نویسندگان

  • Majdi Beseiso
  • Abdulkareem Al-Alwani
چکیده

Coreference resolution is considered one of the challenges in natural language processing. It is an important task that includes determining which pronouns are referring to which entities. Most of the earlier approaches for coreference resolution are rule-based or machine learning approaches. However, these types of approaches have many limitations especially with Arabic language. In this paper, a different approach to coreference resolution is presented. The approach uses morphological features and dependency trees instead. It has fivestages, which overcomes the limitations of using annotated datasets for learning or a set of rules. The approach was evaluatedusing our own customized annotated dataset and “AnATAr” dataset. The evaluation show encouraging results with average F1 score of 89%. Keywords—Coreference resolution; Anaphora; Alternative Approach; Arabic NLP; morphological features

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corefrence resolution with deep learning in the Persian Labnguage

Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...

متن کامل

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

Multi-Lingual Coreference Resolution With Syntactic Features

In this paper, we study the impact of a group of features extracted automatically from machine-generated parse trees on coreference resolution. One focus is on designing syntactic features using the binding theory as the guideline to improve pronoun resolution, although linguistic phenomenon such as apposition is also modeled. These features are applied to the Arabic, Chinese and English corefe...

متن کامل

Coreference Resolution of Named Entities and Noun Phrases in Web Pages

An approach for intra-document coreference resolution of named entities and noun phrases is proposed. This approach is a knowledgepoor, integrated approach to coreference resolution which relies on syntactic, discourse and semantic information (using WordNet). Our approach is also intended to exploit the structural features of web pages for the purposes of discourse analysis. This research is i...

متن کامل

Machine Learning for Mention Head Detection in Multilingual Coreference Resolution

This work introduces a machine learning approach to the identification of mention heads needed for multilingual coreference resolution (MCR). We evaluate the method and compare it to a heuristic baseline and a rule-based approach, which are widely used in coreference resolution systems. We use the CoNLL-2012 shared task data sets, which include data for Arabic, Chinese, and English. We show tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016